A Log-energy Scaling Normalization Scheme for Robust Speech Recognition

نویسندگان

  • Hung-Shin Lee
  • Hung-Bin Chen
  • Berlin Chen
چکیده

The log-energy parameter, as an auxiliary but influential feature, has been commonly used to augment Mel-frequency cepstral coefficients (MFCCs) to improve the recognition accuracy in automatic speech recognition (ASR). In this paper, a new and effective scaling approach named log-energy scaling normalization (LESN), which utilizes special nonlinear scaling functions on noisy speech data for log-energy normalization, is investigated. The scaling function is contrived conceptually from the relationship between clean and noisy data. In the experiments carried out with the Aurora-2 database, LESN is shown average word error rate (WER) improvements of 37.53%, 43.93% and 9.66% for Test Sets A, B, and C, respectively, when compared with the baseline processing. The results also show that LESN outperforms other similar approaches, such as log-energy dynamic range normalization (ERN). Furthermore, as well as being integrated with cepstral mean and variance normalization (CMVN), LESN is further applied to the LVCSR task and a considerable gain is achieved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

An energy normalization scheme for improved robustness in speech recognition

The log energy parameter has long been used as an extension to the basic cepstral feature vector in speech recognition. The use of a normalization technique for the log energy parameter has also been widely accepted. In this paper, a simple energy normalization scheme is introduced that allows direct use of the frame energy parameter in speech recognition and performs well in the presence of no...

متن کامل

Silence energy normalization for robust speech recognition in additive noise environment

The energy parameter has been widely used as an extension to the basic features of mel-frequency cepstral coefficients (MFCCs) to improve the recognition accuracy in speech recognition. In this paper, a simple and effective approach for energy normalization for silence (non-speech) portions in an utterance is proposed. This approach, named as silence energy normalization (SEN), uses the high-pa...

متن کامل

Normalization of speaker variability by spectrum warping for robust speech recognition

This paper examines techniques for normalization of unseen speakers in recognition. Two implementations of linear spectrum warping were examined: time domain resampling and filter bank scaling. It is shown that for seen speakers, the models trained by unwarped utterances are less sensitive to spectrum warping by filter bank scaling than by resampling. A pitch-based scheme for warping factor est...

متن کامل

Mon.O2a.05 Sub-band Based Log-energy and Its Dynamic Range Stretching for Robust In-car Speech Recognition

Log energy and its delta parameters, typically derived from full-band spectrum, are commonly used in automatic speech recognition (ASR) systems. In this paper, we address the problem of estimating log energy in the presence of background noise (usually resulting in a reduction in dynamic ranges of spectral energies). We theoretically show that the background noise affects the trajectories of th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007